Exercises

Advance Analytics with R (UG 21-24)

Ayush Patel

Before we start

Please load the following packages

library(tidyverse)
library(MASS)
library(ISLR)
library(ISLR2)
library(nnet)### get this if you don't
library(e1071) ## get this if you don't
library(modeldata)## get this if you don't



Access lecture slide from bit.ly/aar-ug

Warrior's armor(gusoku)
Source: Armor (Gusoku)

Hello

I am Ayush.

I am a researcher working at the intersection of data, law, development and economics.

I teach Data Science using R at Gokhale Institute of Politics and Economics

I am a RStudio (Posit) certified tidyverse Instructor.

I am a Researcher at Oxford Poverty and Human development Initiative (OPHI), at the University of Oxford.

Reach me

ayush.ap58@gmail.com

ayush.patel@gipe.ac.in

Exercise 1

  • See the Sacremento data in package modeldata.
  • Carry out Exploratory analysis to see variables that are associated with price of a house.
  • Train a model that will predict prices of houses.
  • Carry out model diagnostics. Make necessary changes to the model.
  • Check model performance.

Exercise 2

  • See the attrition data in modeldata.
  • Explore relation of variables with attrition.
  • Create a model that explains attrition in relation with available variables.
  • How well does the model perform?

Exercise 3

  • See the hpc_data data in modeldata.
  • Create a model to predict the class variable.
  • Calculate training and validation error rates.
  • Check model performance.

Exercise 4

How would you estimate average number of twitter followers of a All twitter using Bsc students at Gokhale Institute of Economics and Politics?

The only condition is that you cannot talk to more than 40 randomly selected twitter using BSc students at Gokhale Institute of Economics and Politics.